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Abstract 

Meta-theorems for polynomial (linear) kernels have been the subject of intensive 
research in parameterized complexity Heretofore, there were meta-theorems for linear 
kernels on graphs of bounded genus, ii-minor-free graphs, and .ff-topological-minor-free 
graphs. To the best of our knowledge, there are no known meta-theorems for kernels for 
any of the larger sparse graph classes: graphs of bounded expansion, locally bounded 
expansion, and nowhere dense graphs. In this paper we prove meta-theorems for 
these three graph classes. More specifically, we show that graph problems that have 
finite integer index (FII) have linear kernels on graphs of bounded expansion when 
parameterized by the size of a modulator to constant-treedepth graphs. For graphs of 
locally bounded expansion, our result yields a quadratic kernel and for nowhere dense 
graphs, a polynomial kernel. While our parameter may seem rather strong, we show 
that a linear kernel result on graphs of bounded expansion with a weaker parameter will 
necessarily fail to include some of the problems included in our framework. Moreover, 
we only require problems to have FII on graphs of constant treedepth. This allows us 
to prove linear kernels for problems such as Longest Path/Cycle, Exact s,£-Path, 
Treewidth, and Pathwidth which do not have FII in general graphs. 



*Research funded by DFG-Project RO 927/12-1 "Theoretical and Practical Aspects of Kernelization", the 
Czech Science Foundation under grant P202/ 11/0196, and the European Social Fund and the state budget of 
the Czech Republic under project CZ. 1.07/2.3.00/30.0009 (S. Ordyniak). 



1 Introduction 



Data preprocessing has always been a part of algorithm design. The last decade has seen steady 
progress in the area of kernelization, an area which deals with the design of polynomial-time 
preprocessing algorithms. These algorithms compress an input instance of a parameterized 
problem into an equivalent output instance whose size is bounded by some (small) function of 
the parameter. Parameterized complexity theory guarantees the existence of such kernels for 
problems that are fixed-parameter tractable. Of special interest are cases for which the size of 
the output instance is bounded by a polynomial (or even linear) function of the parameter, 
the so-called polynomial (or linear) kernels. 

Interest in linear kernels is not new and there have been a series of meta-theorems on linear 
kernels on sparse graph classes. A meta-theorem is a result that focuses on a problem class 
instead of an individual problem. In the area of graph algorithms, such meta-theorems usually 
have the following form: all problems that have a specific property admit an algorithm of a 
specific type on a specific graph class. The first steps towards such a meta-theorem appeared 
in a paper by Guo and Niedermeier who provided a prescription of how to design linear kernels 



on planar graphs for graph problems which satisfy a certain distance property 22 . Their work 
built on the seminal paper by Alber, Fellows, and Niedermeier who showed that Dominating 
Set has a linear kernel on planar graphs fl]. This was followed by the first true meta-theorem 
in this area by Bodlaender et al. (6) who showed that graph problems that have finite integer 
index (FIT) on bounded genus graphs and satisfy a property called quasi- compactness, admit 
linear kernels on bounded genus graphs. Shortly after |6] was published, Fomin et al. [21] 
proved a meta-theorem for linear kernels on if -minor-free graphs, a graph class that strictly 
contains graphs of bounded genus. A rough statement of their main result states that any 
graph problem that has FII, is bidimensional, and satisfies a separation property has a linear 
kernel on graphs that exclude a fixed graph as minor. This result was, in turn, generalized 
in 25 to if-topological-minor-free graphs, which strictly contain if-minor-free graphs. Here, 



the problems are required to have FII and to be treewidth- bounding. 

The keystone to all these meta-theorems is finite integer index. Roughly speaking, a 
graph problem has finite integer index if there exists a finite set S of graphs such that every 
instance of the problem can be "represented" by a member of S. This property is the basis 
of the protrusion replacement rule whereby protrusions (pieces of the input graph satisfying 
certain requirements) are replaced by members of the set S. The protrusion replacement rule 
is a crucial ingredient for proving small kernels. It is important to note that FII is an intrinsic 
property of the problem itself and is not directly related to whether it can be expressed in a 
certain logic. In particular, MSO2 expressibility does not imply FII (see (6) for sufficiency 
conditions for a problem expressible in counting MSO to have FII). As an example of this 
phenomenon, Hamiltonian Path has FII on general graphs whereas Longest Path does 
not, although both are EMSO-expressible. Another point about FII is that a problem may 
not have FII in general graphs but may do so in restricted graph classes. 

Although these meta-theorems (viewed in chronological order) steadily covered larger 
graph classes, the set of problems captured in their framework diminished as the other 
precondition(s) became stricter. For if-topological-minor-free graphs this precondition is to 
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be treewidth bounding. A graph problem is treewidth-bounding if YES-instances have a vertex 
set of size linear in the parameter, the deletion of which results in a graph of bounded treewidth. 
Such a vertex set is called a modulator to bounded treewidth. Prototypical problems that 
satisfy this condition are Feedback Vertex Set and Treewidth t- Vertex Deletion^ 
when parameterized by the solution size. A YES-instance (G, k) of Feedback Vertex Set 
satisfies this condition since any feedback vertex set with at most k vertices bounds the 
treewidth of the remaining graph by 1. Similarly, for Treewidth t- Vertex Deletion, 
any k sized solution bounds the treewidth by t. While the property of being treewidth- 
bounding appears to be a strong prerequisite to proving a meta-theorem, it is important to 
note that the combined properties of bidimensionality and separability (used to prove the 
result on if -minor- free graphs) imply that the problem is treewidth-bounding [21]. In fact, 
quasi-compactness may be viewed as a relaxation of treewidth-bounding. What this shows is 
that all meta-theorems on linear kernels for graph classes up until if-topological-minor-free 
graphs implicitly used a property akin to treewidth-boundedness. 



Another way of viewing the meta-theorem in 25 is as follows: when parameterized by a 
treewidth modulator, problems that have FII have linear kernels in if-topological-minor-free 
graphs. A natural problem therefore is to identify the least restrictive parameter that can 
be used to prove a meta-theorem for linear kernels for the next well-known class in the 
sparse-graph hierarchy, namely, graphs of bounded expansion. This class was defined by 



Nesetfil and Ossona de Mendez 29 and subsumes the class of if-topological-minor-free 
graphs. However, a modulator to bounded treewidth does not seem to be a useful parameter 
for this class. Any graph class Q can be transformed into a class Q of bounded expansion by 
replacing every graph G G Q with G, obtained in turn by replacing each edge of G by a path 
on | V(Cr)| vertices. This transformation changes neither the treewidth nor the feedback vertex 
numbers of the graphs. Hence, if a treewidth-bounding graph problem (that additionally 
has FII) has a linear kernel on graphs of bounded expansion then, in particular, Feedback 
Vertex Set and Treewidth t- Vertex Deletion have linear (vertex) kernels in general 
graphs. The best-known vertex kernel for Feedback Vertex Set in general graphs is 



quadratic 31 , for Treewidth t- Vertex Deletion in general graphs is of size k 9 ^\ where 



g is some function 20 . This strongly suggests that one would have to choose an even 
more restrictive parameter to prove a meta-theorem for linear kernels on graphs of bounded 
expansion. In particular, the parameter must not be invariant under edge subdivision. If we 
assume that the parameter does not increase for subgraphs, it must necessarily attain high 
values on paths. Treedepth [29] is precisely a parameter that enforces this property, since 
graphs of bounded treedepth are essentially degenerate graphs with no long paths. Note that 
bounded treedepth implies bounded treewidth. 

Our contribution. We show that, assuming FII, a parameterization by the size of a 
modulator to bounded treedepth allows for linear kernels in linear time on graphs of bounded 
expansion. The same parameter yields quadratic kernels in graphs of locally bounded 
expansion and polynomial kernels in nowhere dense graphs, both strictly larger classes. In 



particular, nowhere dense graphs are the largest class that may still be called sparse 29 . In 



1 For problem definitions, see Appendix. 
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these results we do not require a treedepth modulator to be supplied as part of the input, as 
we show that it can be approximated to within a constant factor. 

Furthermore, we only need FII to hold on graphs of bounded treedepth, thus including 
problems which do not have FII in general. Some problems that are included because of this 
relaxation are Longest Path/Cycle, Pathwidth and Treewidth, none of which have 
polynomial kernels with respect to their standard parameters, even on sparse graphs, since 
they admit simple AND/OR-Compositions ||5|. Problems covered by our framework include 
Hamiltonian Path/Cycle, several variants of Dominating Set, (Connected) Vertex 
Cover, Chordal Vertex Deletion, Feedback Vertex Set, Induced Matching, 
and Odd Cycle Transversal. In particular, we cover all problems included in earlier 
frameworks (6 21 25 . We wish to emphasize, however, that this paper does not subsume 



these results because of our usage of a structural parameter. 

To show that a parameterization by a treedepth modulator has merit outside the sparse- 
graph hierarchy, we extend the polynomial kernel result for Longest Path in [7] parameter- 
ized by the vertex cover number to the weaker treedepth-modulator parameter. Finally, notice 
that a kernelization result for Treewidth, Pathwidth or Longest Cycle on graphs 
of bounded expansion with a parameter closed under edge subdivision would automatically 
imply the same result for general graphs. This forms the crux of our belief that any relaxation 
of the treedepth parameter to prove a meta-theorem for linear kernels on graphs of bounded 
expansion will exclude problems akin to these three. 

We now describe how this paper is organized. The notation that we use, the main 
definitions pertaining to graph classes can all be found in Section [2j Section [3] deals with 
the notion of finite integer index and the protrusion machinery. In Section |4j we prove our 
meta-theorems for graphs of bounded expansion, locally bounded expansion, and nowhere 
dense graphs. Section [5] deals with polynomial kernels for Longest Path with the treedepth 
number as parameter. We briefly discuss the parameterized ecology program and how 
treedepth fits into this program in Section [6j We conclude in Section [7] with some open 
problems. In the appendix, we define some of the graph-theoretic problems that we mention 
in this paper. 



2 Preliminaries 



We use standard graph-theoretic notation (see 13 for any undefined terminology). All our 
graphs are finite and simple. Given a graph G, we use V(G) and E(G) to denote its vertex 
and edge sets. For convenience we assume that V(G) is a totally ordered set, and use uv 
instead of {u,v} to denote the edges of G. For X C V(G), we let G[X] denote the subgraph 
of 67 induced by X, and we define G — X := G[V(G) \ X]. Since we will mainly be concerned 
with sparse graphs in this paper, we let |67| denote the number of vertices in the graph G. The 
distance da(v,w) of two vertices v,w G V(G) is the length (number of edges) of a shortest 
v, w-path in G and oo if v and w lie in different connected components of G. The diameter 
diam(G) of a graph is the length of the longest shortest path between all pair of vertices in 
67. We denote by oo(G) the size of the largest complete subgraph of G. 



4 



The concept of neighborhood is used heavily throughout the paper. The neighborhood 
of a vertex v G V(G) is the set N G (v) = {w G V(G)\vw G E(G)}, the degree of v is 
deg G (f ) = \N G (v )|, and the closed neighborhood of v is defined as A^ G [f] := N G (v) U {v}. We 
extend this naturally to sets of vertices and subgraphs: For S C V(G) we denote N G (S) the 
set of vertices in V(G) \ S that have at least one neighbor in S, and for a subgraph H of G 
we put N G (H) = N G (V(H)). Finally if X is a subset of vertices disjoint from S, then N^(S) 
is the set N G (S) n X (and similarly for N%(H)). Given a graph G and a set W C ^(G), 
we also define <9g(W0 as the set of vertices in VF that have a neighbor in V \ W. Note that 
N G {W) = dG{V(G) \ W). A graph G is d-degenerate if every subgraph of G' C G contains a 
vertex t> G V(Cr') with deg G (v) ^ d. The degeneracy of G is the smallest d such that G is 
d-degenerate. 

In the rest of the paper we often drop the index G from all the notation if it is clear which 
graph is being referred to. 

2.1 Minors and shallow minors 

We start by defining the notion of edge contraction. Given an edge e = uv of a graph G, 
we let G/e denote the graph obtained from G by contracting the edge e, which amounts 
to deleting the endpoints of e, introducing a new vertex w uv , and making it adjacent to all 
vertices in (N(u) U N(v)) \ {u,v}. By contracting e = uv to the vertex w, we mean that 
the vertex w uv is renamed as w. Subdividing an edge is, in a sense, an opposite operation to 
contraction. A graph G is called a ^ k-subdivision of a graph H if (some) edges of H are 
replaced by paths of length at most k + 1. 

A minor of G is a graph obtained from a subgraph of G by contracting zero or more 
edges. If if is a minor of G, we write H ^ m G. A graph G is H -minor-free if H ^ m G. 

We next introduce the notion of a shallow minor. 

Definition 1 (Shallow minor 129]). For an integer d, a graph if is a shallow minor at depth d 
of G if there exists a set of disjoint subsets V\, . . . , V p of V(G) such that 

1. each graph G[Vi\ has radius at most d, meaning that there exists vi G Vi (a center) 
such that every vertex in is within distance at most d in G[VJ]; 

2. there is a bijection ip: V(H) ->■ {Vi, . . . , V^} such that for u, u G V(-ff"), w G E(H) iff 
there is an edge in G with an endpoint each in ip(u) and ip{v). 

Note that if «, v G V(if), ^(w) = VI, and ^(u) = Vj then da{vi, Vj) ^ (2d + 1) • d H {u, v). The 
class of shallow minors of G at depth d is denoted by G V d. This notation is extended to 
graph classes Q as well: §Vd = Lbee C V d. 

2.2 Parameterized problems, kernels and treewidth 

In this paper we deal with parameterized problems where the value of the parameter is not 
explicitly specified in the input instance. This situation is slightly different from the usual 
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case where the parameter is supplied with the input and a parameterized problem is defined 



as sets of tuples (x, k) as in 15 . As such, we find it convenient to adopt the definition of 
Flum and Grohe [18] and we feel that this is the approach one might have to choose when 
dealing with generalized parameters as is done in this paper. 

Let E be a finite alphabet. A parameterization of E* is a mapping k: E* — > N that is 
polynomial time computable. A parameterized problem II is a pair (Q, k) consisting of a set 
Q C E* of strings over E and a parameterization k over E*. A parameterized problem II is 
fixed-parameter tractable if there exist an algorithm A, a computable function / : N — > N 
and a polynomial p such that for all x G E*, A decides x in time f{n{x)) ■ p(\x\). 

Definition 2 (Graph problem). A graph problem II is a set of pairs (G,£), where G is a graph 
and £ G N , such that for all graphs G\, G2 and all £ G N , if G\ = G2 then (G\, £) G II iff 
(<J2,£) G II. For a graph class Q, we define Tig as the set of pairs (G, £) G II such that G G (/. 

Definition 3 (Kernelization) . A kernelization of a parameterized problem (Q,k) over the 
alphabet E is a polynomial-time computable function A: E* — >■ E* such that for all x G E*, 
we have 

1. x G Q if and only if G Q, 

2. L4(:r)K 

where g is some computable function. The function g is called the size of the kernel. If 

g(n(x)) = k(x)° ( - 1 ^ or g(n(x)) = 0(k(x)), we say that II admits a polynomial kernel and a 
linear kernel, respectively. 

Definition 4 (Treewidth). Given a graph G — (V,E), a tree- decomposition of G is an ordered 
pair (T, W), where T is a tree and W = {W 7 ^. C V \ x G V^(T)} is a collection of vertex sets 
of G, with one set for each node of the tree T such that the following hold: 

1- U xm T)W x = V(G); 

2. for every edge e = uv in G, there exists x G V(T) such that u,v G W^; 

3. for each vertex w G V(G), the set of nodes {x G V(T) | u G W^} induces a subtree. 

The vertices of the tree T are usually referred to as nodes and the sets W x are called bags. 
The width of a tree-decomposition is the size of a largest bag minus one. The treewidth of G, 
denoted tw(G), is the smallest width of a tree-decomposition of G. 

In the definition above, if we restrict T to being a path, we obtain well-known notions of 
a path-decomposition and pathwidth. We let pw(G) denote the pathwidth of G. In the sequel 
we will often implicitly use the following fact about tree decompositions (which implies that 
treewidth is a parameterization in the sense of our definition if it is bounded) . 

Proposition 1 (k3j). Given a graph G with n nodes and a constant w, it is possible to decide 
whether G has treewidth at most w, and if so, to compute an optimal tree decomposition of G 
in time 0(n). 
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2.3 Grad and graph classes of bounded expansion 

Let us recall the main definitions pertaining to the notion of graphs of bounded expansion. 
We follow the recent book by Nesetfil and Ossona de Mendez [29]. 



Definition 5 (Greatest reduced average density (grad) |26,30|). Let Q be a graph class. 
Then the greatest reduced average density of Q with rank d is defined as 

V,(S)= sup M 
Hegvd \ V{H)\ 

This notation is also used for graphs via the convention that Vd(G) := Vd({G}). In particular, 
note that G V denotes the set of subgraphs of G and hence 2Vo(G) is the maximum average 
degree of all subgraphs of G. The degeneracy of G is, therefore, exactly 2Vo(G). 

Definition 6 (Bounded expansion |26|). A graph class Q has bounded expansion if there exists 
a function /: N — > R (called the expansion function) such that for all d G N, Vd{G) ^ f{d). 

If Q is a graph class of bounded expansion with expansion function /, we say that Q has 
expansion bounded by f. An important relation we make use of later is: Vd(G) = Vo(G V d), 
i.e. the grad of G with rank d is precisely one half the maximum average degree of subgraphs 
of its depth d shallow minors. 

Another important notion that we make use of extensively is that of treedepth. In this 
context, a rooted forest is a disjoint union of rooted trees. For a vertex i in a tree T of a 
rooted forest, the height (or depth) of x in the forest is the number of vertices in the path 
from the root of T to x. The height of a rooted forest is the maximum height of a vertex of 
the forest. The closure clos(J r ) of a rooted forest T is the graph with vertex set \J T€:F V(T) 
and edge set {xy: x is an ancestor of y in J 7 }. A treedepth decomposition of a graph G is a 
rooted forest T such that G C clos(J r ). 

Definition 7 (Treedepth). The treedepth td(G) of a graph G is the minimum height of any 
treedepth decomposition of G. 

In the sequel we will often use the following fact about treedepth decompositions. 

Proposition 2 ((29]). Given a graph G with n nodes and a constant w, it is possible to decide 
whether G has treedepth at most w, and if so, to compute an optimal treedepth decomposition 
of G in time 0(n). 

We list some well-known facts about graphs of bounded treedepth. Proofs that are omitted 



and can be found in 29 . 



1. If a graph has no path with more than d vertices, then its treedepth is at most d. 

2. If td(G) ^ d, then G has no paths with 2 d vertices and, in particular, any DFS-tree 
of G has depth at most 2 d — 1. 

3. If td(G) ^ d, then G is <i-degenerate and hence has at most d ■ \V(G)\ edges. 
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> Graph 



Figure 1: The anatomy of a protrusion. 



4. If td(G) < d, then tw(G) ^ pw(G) < d - 1. 

A useful way of thinking about graphs of bounded treedepth is that they are (sparse) graphs 
with no long paths. 

For a graph G and an integer d, a modulator to treedepth d of G is a set of vertices 
M C V(G) such that td(G — M) ^ d. The size of a modulator is the cardinality of the set 
M. 

Finally, we need the following well-known result on degenerate graphs. 

Proposition 3 (|32]). Every d-degenerate graph G with n ^ d vertices has at most 2 d (n—d+l) 
cliques. 



3 The Protrusion Machinery 

In this section, we recapitulate the main ideas of the protrusion machinery developed in j6 21 

Definition 8 (r-protrusion |6|). Given a graph G, a set W C V(G) is a r -protrusion of G 
if |<9g(W)| ^ r and tw(G[W]) ^ r — 1. F] We call &g{W) the boundary and \W\ the size of 
the protrusion W . For an r-protrusion W, we call the set W — W \ do(W) the restricted 
protrusion of W. 

Thus an r-protrusion in a graph is a subgraph that is separated from the rest of the graph 
by a small boundary and, in addition, has small treewidth. See Figure [T] 

A t-boundaried graph is a graph G with a set bd{G) of t distinguished vertices labeled 1 
through t, called the boundar^ or the terminals of G. Given a graph class Q, we let Q t denote 
the class of t-boundaried graphs from Q. If W C V(G) is an r-protrusion in G, then we 
let Gw be the r-boundaried graph G[W] with boundary da(W), where the vertices of da(W) 
are assigned labels 1 through r according to their order in B. 



2 We want the bags in a tree-decomposition of G[W] to be of size at most r. 
3 Usually denoted by d(G), but this collides with our usage of d. 
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Definition 9 (Gluing and ungluing). For t-boundaried graphs G\ and G 2 , we let G\ © G 2 
denote the graph obtained by taking the disjoint union of G\ and G 2 and identifying each 
vertex in bd(Gi) with the vertex in bd(G 2 ) with the same label. This operation is called 
gluing. 

Let H C G with a boundary B of size t. The operation of ungluing H from G creates the 
t-boundaried graph GQbH := G — (V(H) \ B) with boundary B. The vertices of foi(C7 b H) 
are assigned labels 1 through t according to their order in B. 

Note that the gluing operation entails taking the union of edges both of whose endpoints 
are in the boundary with the deletion of multiple edges to keep the graph simple. The 
ungluing operation preserves the boundary (both the vertices and the edges). For the sake of 
clarity, we sometimes annotate the © and operators with the boundary as well. 

Definition 10 (Replacement). Let G be a graph with a t- protrusion W and let if be a 
t-boundaried graph. Then replacing W by H corresponds to the operation (G8s Gw) ®b H. 

We now restate the definition of one of the most important notions used in this paper. 

Definition 11 (Finite integer index; FII). Let Tig be a graph problem restricted to a class Q 
and let Gi,G 2 be two t-boundaried graphs in Q t . We say that G\ =n g> t G 2 if there exists an 
integer constant A ng t (Gi,G 2 ) (that depends on Tig, t, and the ordered pair (Gi,G 2 )) such 
that for all t-boundaried graphs G G Qt and for all (6N: 

1. G 1 © G e G iff G 2 © G e G\ 

2. (Gi © G, e Ug iff (G 2 © G,£ + A Ug>t (Gi, G 2 )) e U g . 

Note that A n „,t(Gi, G 2 ) = -A ng ,t(G 2 , Gi). In the case that (d © G, f) ^ Tig or G x © G Q 
for all G G Qt, we set Au g ,t{G\, G 2 ) = 0. We say that the problem Tig has finite integer index 
in the class Q 1 C Q if, for every integer t, there are at most g(t) equivalence classes of =u g ,t 
that contain at least one member of Q', where g is a function that depends on t, Tig and Q' '. 

Thus a problem Tig has finite integer index in the class Q' C Q iff for every integer t 
the equivalence relation =u g ,t restricted to Q 1 has finite index. If a graph problem has finite 
integer index then its instances can be reduced by "replacing protrusions". The technique 
of replacing protrusions hinges on the fact that each protrusion of "large" size can be 
replaced by a "small" gadget from the same equivalence class as the protrusion, which 
consequently behaves similarly w.r.t. the problem at hand. If G\ is replaced by a gadget G 2 , 
then £ changes by An g ,t(Gi, G 2 ). Many problems have finite integer index in general graphs 
including Vertex Cover, Independent Set, Feedback Vertex Set, Dominating 
Set, Connected Dominating Set, Edge Dominating Set. For a more complete 
list see (6 21 . Some problems that do not have finite integer index in general graphs are 



Connected Feedback Vertex Set, Longest Path and Longest Cycle. 

Our definition above is more general than the one in [To] in that we define a problem Tig 
to have finite integer index in a subclass Q' C Q rather than in the whole class Q. The main 
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reason behind this is the following. In Section |4| we restrict our inputs to graphs of bounded 
expansion but the protrusions that we replace satisfy the additional property that they have 
bounded treedepth. Our technique of replacing protrusions relies on a dynamic programming 
approach that takes one "large" protrusion of treedepth at most d and transforms it gradually 
by replacing small pieces from it (which are themselves protrusions) by still smaller sized 
representatives. We show that by systematically replacing all "large" protrusions of bounded 
treedepth we obtain a linear kernel. The property of finite integer index is used only for 
replacing protrusions which is why we require it to hold only for graphs of bounded expansion 
of treedepth at most some fixed constant. Our prototypical problem, Longest Path, does 
not have finite integer index on graph classes of bounded expansion but — as shown later — does 
so when we restrict the treedepth to be at most some fixed constant. Thus, this relaxed 
notion of FII allows us to deal with problems that do not necessarily have finite integer index 
on graphs of bounded expansion but do so when, in addition, the treedepth is bounded. 

One must, however, be careful while replacing these pieces as we have to make sure that 
whatever they are replaced with must also have treedepth at most d. The following lemma 
will be used to show that this procedure for replacing protrusions is valid. We state Lemma [T] 
and Reduction Rule [I] in a more general setting than is necessary for this paper because we 
hope that, stated in this fashion, they might be more applicable elsewhere. 

In this setting, we assume that there exists a function ip : Q — > N that maps members 
of a graph class to the integers. In our case, we use if = td. We let Q{d) denote the set of 
graphs G a Q for which ip(G) ^ d. The problems Ug that we consider are such that for all 
d G N, Ilg has finite integer index in Q(d). This means that while there can be an infinite 
number of equivalence classes of the relation =n g ,t, for each d G N, at most g(t, d) of these 
equivalence classes contain at least one graph G with <p(G) ^ d, where g is some function of 
t, d and the problem II. For each boundary size t and d G N, we let TZt,g(d) denote a set of 
graphs from Q(d) that are representatives of these equivalence classes of =n g ,t that contain 
at least one graph G with <p(G) ^ d. 

Lemma 1. Fix c,d,t G N. If H is a t-boundaried graph in Q{c- d) such that H =n g ,t H' for 
some t-boundaried graph H' in G(d), then there exists R G Tlt,g{d) such that R =n g ,t H. 

Proof. Since H =u g ,t H', the equivalence class of =u g ,t containing H contains at least one 
graph from Q(d), namely H' itself. By the definition of Rt,gu) there exists an R G Q(d) that 
is a member of Rt,g(d) with R =n g ,t H. □ 

For a graph problem II that has finite integer index in the class Q, we let pu g {t, d) denote 
the size of the largest representative in !Z t ,g(d)- Subscripts are omitted when the problem is 
clear from the context. Our reduction rule may be stated formally as follows. 

Reduction Rule 1 (Protrusion replacement). Let (G, £) G Ug and c, d, t G N be constants. 
Suppose that W C V(G) is a t-protrusion of G such that \W\ ^ 2p(t,cd) and suppose 
that <p(Gw) ^ cd, and G[W] =u g ,t H, where f(H) ^ d. Further let R G T^t,g{d) be the 
representative of H. The protrusion replacement rule is the following: 

Reduce (G, £) to (G 1 , £') := ((G Q B G w ) ®b R, £ + &n g ,t( G w, R)). 
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The next lemma shows that this rule is indeed safe. 



Proposition 4 (Safety [25] ). If(G',£') is the instance obtained from one application of the 
protrusion Reduction ruleU\to the instance (G,£) ofUg, then 

1. G' G Q; 

2. (£?',£') is a YES-instance iff(G,£) is a YES-instance. 

In what follows, unless otherwise stated, when applying protrusion replacement rules we 
will assume that for each t G N, we are given the set Tlt,g of representatives of the equivalence 
classes of =n g ,t- Note that this makes our algorithms of Section [4] non-uniform. However 
non-uniformity is implicitly assumed in previous work that used the protrusion machinery 
for designing kernelization algorithms (6 19 21 , too. 



4 Linear Kernels on Graphs of Bounded Expansion 

In this section we show that graph-theoretic problems that have finite integer index on 
fixed-treedepth subclasses of graph classes of bounded expansion admit linear kernels, where 
the parameter is the size of a modulator to constant treedepth. Recall that a treedepth-c? 
modulator in a graph G is a vertex set S C V(G) such that G — S has treedepth at most d. 

Theorem 1. Let Q be a graph class of bounded expansion and for p G N, let Q(p) C Q be the 
subclass of graphs of treedepth at most p. Let Tig be a graph problem that has finite integer 
index on Q(p) for each p G N and let d G N be a constant. Then there is an algorithm that 
takes as input (G,£) G Tig and, in time 0(\G\), outputs an equivalent instance (£?',£') such 
that \G'\ = 0(\S\), where S is an optimum treedepth-d modulator of the graph G. 

Note that we do not assume that we are given an optimal treedepth-c? modulator. We 
show that one can approximate this to within a constant. Our proof uses an approximate 
modulator S C V(G) to decompose V(G) into vertex-disjoint sets Y l±l Y\ W • • • W Yt such that 

1. S CY and |Y | = 0(\S\); 

2. for 1 ^ % ^ I, Y{ induces a collection of connected components that have exactly the 
same small neighborhood in Yq (to be defined later). 

We then use properties of graphs of bounded expansion to show that £ — 0(\S\). Finally, we 
use the protrusion replacement rule to replace each Yj by a graph of constant size. Every 
time the protrusion replacement rule is applied, £ is modified. This results in an equivalent 
instance (£?',£') such that \G'\ = 0(\S\), which is what we claim. 

First let us show that one can approximate a treedepth-c? modulator to within a constant. 

Lemma 2. Fix d G N. Given a graph G, one can in polynomial time compute a subset 

5 C V{G) such that td(G — S) ^ d and \S\ is at most 2 d times the size of an optimal 
treedepth-d modulator of G. If G is from a graph class of bounded expansion, then the same 
can be achieved in linear time. 
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Proof. We use the fact that any DFS-tree of a graph of treedepth d has depth at most 2 d — 1. 
We compute a DFS-tree of the graph G and if it has depth more than 2 d — 1, then td(G) > d. 
So, we take some path P from the root of the tree of length 2 d — 1 and add all the 2 d vertices 
of P into the modulator; delete V(P) from the graph and repeat. (Clearly, at least one of 
the vertices of P must be in any modulator.) At the end of this procedure, the DFS-tree of 
the remaining graph has depth at most 2 d — 1. This gives us a tree (path) decomposition of 
the graph of width at most 2 d — 2. Now use standard dynamic programming to obtain an 
optimum treedepth-d modulator. Since the treewidth of the remaining graph is a constant, 
the dynamic programming algorithm runs in time linear in the size of the graph. The overall 
size of the modulator has size at most 2 d times the optimal solution. 

For a graph G from a class of bounded expansion, we modify the iterated depth-first 
search. By |26|, graph classes of bounded expansion admit low treedepth coloring: Given 
any integer p, there exists an integer n p such that any graph of the class can be properly 
vertex colored using n p colors such that for any set of 1 ^ i ^ p colors, the graph induced by 
the vertices that receive these i colors has treedepth at most i. Such a coloring is called a 
p-treedepth coloring and can be computed in linear time. Here we choose p = 2 d and obtain 
such a coloring for G using n p colors. Let G\, . . . , G r denote the subgraphs induced by at 
most 2 d of these color classes where r < 2 rip = 0(1). Note that J2j \Gj\ = 0(\G\), since 
every vertex of G appears in at most a constant number of subgraphs. Any path in G of 
length 2 d — 1 must be in some subgraph Gj, for 1 ^ j ^ r. For each subgraph Gj, we simply 
construct a treedepth decomposition, find all paths of length 2 d — 1, add their vertices into 
the solution and delete them from the graph. The time taken to do this for each subgraph Gj 
is 0(\Gj\). The total time taken is therefore J2j \Gj\ = 0(\G\). □ 

We will make heavy use of the following lemma to prove the kernel size. 

Lemma 3. Let G = (X, Y, E) be a bipartite graph. Then there are at most 

1. 2Vi((j) • \X\ vertices in Y with degree greater than 2Vi(G); 

2. (4 Vl ( G ) + 2Vi(G)) ■ \X\ subsets X' C X such that X' = N(u) for some ueY. 

Proof. We construct a sequence of graphs Gq,Gi, . . . ,Ge such that Gi G G V 1 for all ^ i ^ £ 
as follows. Set Go = G, and for ^ % ^ £ — 1 construct Gi+i from Gi by choosing a vertex 
v G V(Gi) \ X such that N(v) C X contains two non-adjacent vertices u, w in Gf, if no such 
vertex v exists, stop with £ := i. Set e i+ i = uv and contract this edge to the vertex u to 
obtain G i+ \. Recall that contracting uv to u is equivalent to deleting vertex v and adding 
edges between each vertex in N(v) \ u and u. It is clear from the construction that for 
< i < £, X C V(Gi) C X U Y. 

This process clearly terminates, as Gi + \ has at least one more edge between vertices of X 
than Gi. Note that Gj G G V 1 for ^ i ^ £, as the edges ex, ... , that were contracted to 
vertices in X in order to construct Gj had one endpoint each in X and Y, the endpoint in Y 
being deleted after each contraction. Thus, e±, . . . , ej_i induce a set of stars in V(G) = V(Gq), 
and Gi is obtained from G by contracting these stars. We therefore conclude that Gj is a 
depth-one shallow minor of G. In particular, this implies G^[X] is 2Vi(G)-degenerate and 
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has at most 2Vi(G) • \X\ edges. Further, note that for each ^ i ^ £, Y R V(Gi) is, by 
construction, still an independent set in Gi. 

Let us now prove the first claim. To this end, assume that there is a vertex v G Y n V(Gi) 
such that degree(v) > 2Vi(G). We claim that Gg[N(v)] (where N(v) C X) is a clique. If 
not, we could choose a pair of non-adjacent vertices in Gf[N(v)] and construct a (i + l)-th 
graph for the sequence which would contradict the fact that Gg is the last graph of the 
sequence. However, a clique of size \{v} U N(v)\ > 2V±(G) + 1 is not 2Vi(G)-degenerate. 
Hence we conclude that no vertex of Y H V(Ge) has degree larger than 2V\(G) in Ge (and in 
G). Therefore the vertices of Y of degree greater than 2Vi(G) in the graph G, if there were 
any, must have been deleted during the edge contractions that resulted in the graph Ge. As 
every contraction added at least one edge between vertices in X and since G^[X] contains at 
most 2Vi(G) • \X\ edges, the first claim follows. 

For the second claim, consider the set Y' — Y D V(Ge). The neighbourhood of every 
vertex v G Y' induces a clique in G^fX]. From the degeneracy of G^LY], it follows that G^[X] 
has at most 2 2Vl ^|G^[X]| = 4 Vl ^ ■ \X\ cliques. Thus the number of subsets of X that are 
neighbourhoods of vertices in Y in G is at most (4 Vl ( G ) + 2Vi(G)) ■ \X\, where we accounted 
for vertices of Y lost via contractions by the bound on the number of edges in G^[X]. □ 

The following two corollaries to Lemma [3] show how it can be applied in our situation. 

Corollary 1. Let Q be a graph-class whose expansion is bounded by a function f: N — > R. 
Suppose that for G G Q and S C V(G), Ci,...,C s are disjoint connected subgraphs of 
G — S satisfying the following two conditions: for 1 ^ i ^ s, diam(G[V (Q)]) ^ 5 and 
\N s (Ci)\ > 2- f(5+l). Then s ^2- f(6 + 1) ■ \S\. 

Proof. We construct an auxilliary bipartite graph G with partite sets S and Y = {C±, . . . ,C S }. 
There is an edge between Cj and x G S iff x G Ng{Cj). Note that G is a depth-£ shallow 
minor of G with branch sets Cj, 1 ^ i ^ s. By Lemma [3j 

s < 2V 1 (G)|S'| ^2V 1 (Gv5)\S\ =2V S+1 (G)\S\ ^2f(5 + l)\S\. □ 

Corollary 2. Let Q be a graph-class whose expansion is bounded by a function f : N — > R. 
Suppose that for G G Q and S C V(G), C\,...,Ct are sets of connected components of 
G — S such that for all C,C' G \Ji.d it holds that C,C' G Cj for some j if and only if 
Ng(C) = Ns(C'). Let 5 > be a bound on the diameter of the components, i.e. for all 
C G UiCj, diam(G[V{C)\) ^ 5. Then there can be only at most t < (4^ +1 ) + 2f(5 + 1)) • |5| 
such sets C{. 

Proof. As in the proof of Corollary [TJ we construct a bipartite graph G with partite sets S 
and Y = {Ci, . . . , C r }, where the vertices Cj represent connected components in \J { Ci and 
Cj has an edge to x G S iff x G Ng(Cj). As before, G is a shallow minor at depth 5 of G 
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with branch sets Cj, 1 ^ j ^ r. By Lemma [3j 

t < C S \ 3d e Y : N(Ci) = S'}\ < (4 Vl(6) + 2Vi(G)) ■ |5| 

< (4 Vl(Gv5) + 2Vi(Gv5))- |5| 

= ( 4 V, +1 (G) + 2Vm(G)) .| 5 | 

<; ( 4 /(«+i) +2/(5 + 1)) ■ □ 



Algorithm 1: Bag marking algorithm 
Input: A graph G, a subset S C V(G) such that td(G — S) ^ d, and an integer t > 0. 

Set A4 <— as the set of marked bags; 

for each connected component C of G — S such that Ng(C) ^ t do 

Choose an arbitrary vertex v G V(C) as a root and construct a DFS-tree starting 
at v; 

Use the DFS-tree to obtain a path-decomposition Pc = (Pc, Be) of width at most 
2 d — 2 in which the bags are ordered from left to right; 

Repeat the following loop for the path-decomposition Vc of every C; 
while Vc contains an unprocessed bag do 

Let B be the leftmost unprocessed bag of Vc'-, 

Let Gb denote the subgraph of G induced by the vertices in the bag B and in all 
bags to the left of it in Vc- 

[Large-subgraph marking step] 

if Gb contains a connected component Cb such that \Ns(Cb)\ ^ t then 
|_ M <- M U {B} and remove the vertices of B from every bag of Vc] 

Bag B is now processed; 

return Y = SUV(M); 



Lemma 4. Let Q be a graph class with expansion bounded by f , G G Q and S C V(G) be a 
set of vertices such that td(G — S) ^ d (d a constant). There is an algorithm that runs in 
time 0(\G\) and partitions V(G) into sets Yq t±J Y\ t±J • • • l+l Ye such that the following hold: 

1. S C Y and \Y Q \ = 0(\S\); 

2. for 1 ^ i ^ I, Yi induces a set of connected components of G — Yq that have the same 
neighborhood in Yq of size at most 2 d+1 + 2 • f(2 d ); 

3. (4^ + 2/(2 d )) -\S\ = 0(\S\). 
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Proof. We first construct a DFS-forest J 7 of G — S. Assume that there are q trees T\, . . . , T q 
in this forest that are rooted at n, . . . , r q , respectively. Since td(G — S) ^ d, the height of 
every tree in T is at most 2 d — 1. Next we construct for each Tj, where 1 ^ i ^ q, a path 
decomposition of the subgraph of G induced by the vertices in Tj. Suppose that Tj has leaves 
li, . . . ,l s ordered according to their DFS-number. For 1 ^ j ^ s, create a bag Bj containing 
the vertices on the unique path from lj to and string these bags together in the order 
£>i, . . . , B s . It is easy to verify that this is indeed a path decomposition V{ of (j[V(Tj)], that 
each bag has at most 2 d — 1 vertices and that the root is in every bag of the decomposition. 

We now use a marking algorithm similar to the one in [25] to mark 0(15*1) bags in the path 
decompositions V\, . . . , V q with the property that each marked bag can be uniquely identified 
with a connected subgraph of G — S that has a large neighborhood in the modulator S. This 
algorithm is described in Figure [T] in which we set t, the size of a large neighborhood in S, to 
be t := 2 ■ f(2 d ) + 1. Note that there is a one-to-one correspondence between marked bags A4 
and connected subgraphs with a neighborhood of size at least t in S. Moreover each connected 
subgraph has treedepth at most d and hence diameter at most 2 d — 1. By Corollary [IJ the 
number of connected subgraphs of large neighborhood and hence the number of marked bags 
is at most 2 • f(2 d - 1 + 1) ■ \S\ = 2f(2 d ) ■ \S\ = 0(\S\). We set Y : = V(M) U S. 

Now observe that each connected component in G — Yq has less than t = 2 ■ f(2 d ) + 1 
neighbors in S. This follows because for every connected subgraph C with at least t neighbors 
in S, there exists a marked bag B. Importantly, the bag B was the first bag that was marked 
before the number of neighbors in S of any connected subgraph reached the threshold t. 
Hence each connected component of G[V(C) \ B] has degree less than t in S. Since every 
component can be connected to at most two marked bags (in Yq) and since each bag is of 
size at most 2 d — 1, the size of the neighborhood of every component of G — Yq in Yq is at 
most 2(2 d - 1) + 1 < 2 d+1 + 2 ■ f(2 d ). 

To complete the proof, we simply cluster the connected components of G — Y according 
to their neighborhoods in Y to obtain the sets Yi, . . . , Yg. Since each connected component 
of G — S is of diameter 5 ^ 2 d — 1, by Corollary p2| the number i of clusters is at most 



To prove a linear kernel, all that is left to show is that each cluster Y{, 1 ^ i ^ £, can be 
reduced to constant size. Note that each cluster is separated from the rest of the graph via a 
small set of vertices in S and that each component of G — S has constant treedepth. These 
facts enable us to use the protrusion reduction rule. 

In the proof of the following lemma it will be convenient to use the following normal form 
of tree decompositions: A triple (T,{W X \ x G V(T)},r) is a nice tree decomposition of a 
graph G if (T, {W x \ x G V(T)}) is a tree decomposition of G, the tree T is rooted at node 
r G V(T), and each node of T is of one of the following four types: 

1. a leaf node: a node having no children and containing exactly one vertex in its bag; 

2. a join node: a node x having exactly two children yi, y 2 , and W x = W yi = W y2 ; 

3. an introduce node: a node x having exactly one child y, and W x = W y U {v} for a vertex 
v of G with v G" W y 




□ 
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4. a forget node: a node x having exactly one child y, and W x = W y \ {v} for a vertex v 
of G with v G W y . 

Given a tree decomposition of a graph G of width u>, one can effectively obtain in time 
C?(|V(Cr)|) a nice tree decomposition of G with nodes and of width at most w [9]. 

In the context of the next lemma, let Q be a graph class of bounded expansion and, for 
p G N, let £?(p) denote the subclass of £ of graphs of treedepth at most p. Let Ug be a 
graph problem that has finite integer index on Q{p) for every fixed p G N. Recall that p(t, d) 
denotes the size of the largest representative in Ht,g(d), f° r t ne problem Ug. 

Lemma 5. For fixed d, h G N, let (G, £) 6e an instance of Ug and let S C 6e a 

treedepth-d modulator of G. Let Y l±l Yi l±l - • • l±l Y% be a protrusion- decomposition of G, 
where S C Y and for 1 ^ i ^ i, |iVy (Yj)| ^ /i. Then one can in 0(\G\) time obtain an 
equivalent instance (G",£') and a protrusion- decomposition Yq l±l Y[ l±l • • • l±l F/ o/ G' where 
KJ = K 0; and forl^i^e it is \N Y >(Y/)\ <: h and \Y?\ < p(d + h, d) = O(l). 

Proof. Since 5* C y is a treedepth-d modulator, for all 1 ^ i ^ £, we have td(G[Yj]) ^ d and 
hence tw(G[K;]) ^ d — 1. Moreover treedepth at most d implies diameter at most 2 d — 1 for 
each component. For each index 1 ^ i ^ £, our algorithm constructs a tree-decomposition of 
G[Y{ U iV(Yj)] of width d + h that satisfies certain properties that we mention below. The 
algorithm then uses this tree-decomposition to replace Y^ in a systematic manner using the 
protrusion replacement rule. The properties that this tree-decomposition satisfies enable 
the algorithm to perform this replacement in U iV(Yj)|) time. The total time taken 

to replace all sets Y { is Ef=i \Yi U N(Y$\ and since by Lemma § Ef=i |^V(ii)| = 0(\Y Q \), 
the running time is indeed (9(|G|). It therefore suffices to describe what properties our 
tree-decompositions satisfy and how each Yj is replaced. 

The tree-decomposition % = (T u {W x \ x G V(T$\) of width d + h for G, := G{Y i UN{Y i )} 
satisfies the following conditions: 

1. there is a node r G V{Ti) such that N(Yj) = W r ; 

2. the tree-decomposition is nice and the leaf bags contain one vertex. 

The first condition can be achieved by simply modifying the graph Gi so that N(Yi) induces a 
clique, and then introducing an extra node r if no such node exists. The decomposition 71 is 
rooted at the node r. For x G V(Ti), we let G x denote the (d + /i)-boundaried graph induced 
by the vertices in the bags of the subtree of Tj rooted at x. That is, 

G x = G[\JW y ], 

where the union is over all y G V(Ti) that are descendants of x and bd(G x ) = W x . For x G 
V(Ti), denote by A(x) the representative of G x in 7ld+h,g{d) and let = An e ,rf+ft(A(a;) 1 G x ). 
Note that the treedepth of G x is at most d and since Tig has FII in Q(d), such a representative 
A(a;) is indeed well founded. Moreover, |A(x)| ^ M where M := p(d + h,d) denotes the size 
of the largest representative in 1Zd+h,g(d)- 
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In order to replace Yj, it is sufficient to know A(r) and /x(r) which we will calculate in a 
bottom-up manner in (9(|Y|) time as follows. If y G V(Tj) is a leaf node then these values 
can be computed in constant time. Let x G V(Tj) be a node with exactly one child ?/ whose A 
and /i values are known. Consider the (d + /i)-boundaried graph G' x := (G x ®w y G y ) ®w y A(y) 
with bd(G' x ) = W x . We claim that G' x = Ug d+h G x . To prove this, we need to demonstrate 
that for all graphs G and all £ G N, 

(G; ©w* G, G U g if and only if (G x ® Wx G, £ + //) G 

where yl = A Ug4+h (G' x ,G x ). Now 

(G; ® Wx G, G n g iff ((G x ® Wy G y ) ® Wy A{y)) ®w x G, G U g 
iff ((G x ® Wx G) e Wy G y ) ® Wy A(y),0 G U g 
iff ((G x ® Wx G) Q Wy G y ) ® Wy G y , £ + n(y)) G Ug, 

where the last step follows because of A(y) =xi g ,d+h G y . Since {G x ®w x G) ®w y G y ) ®w y G y 
is just the graph G x ®w x G, this proves our claim. In fact, // = y(y). 

Observe that G' x is of constant size, bounded from above by M+ \W X \ ^ M + d+h = 0(1). 
Although A(y) has treedepth at most d, G' x is not guaranteed to have treedepth at most d. In 
fact, G' x can have treedepth up to d + h. However since td^^) ^ d, we can use Lemma [T] to 
conclude that there exists R G 7ld+h,g(d) with G' x =u g ,d+h R, and obtain this R in constant 
time since G' x is of constant size. We set A(x) = R and y{x) = y(y) + An !d+h (G' x , R). Note 
that the total time spent at node x to generate these values is a constant. 

Finally consider the case when x G V(Tj) has exactly two children y\ and y<i whose A 
and \i values are known. Since our tree-decomposition is nice, we have W yi = W x = W y2 
and therefore bd(G yi ) = bd{G y2 ) = W x . Consider the (d + /i)-boundaried graph G" x = 
A(yi) ®w x A(y 2 ) with bd(G x ) = W x . Similarly as in the above case, we demonstrate that for 
all graphs G and all £ G N, 

(G x ® Wx G, G Ug if and only if (G x ® Wx G, £ + n") G U g , where y" = y( Vl ) + y(y 2 ). 

Then G" x has size at most 2M which is a constant. One can therefore, again in constant 
time, calculate a representative R G Hd+h,Q{d) of G' x . Set A{x) = R and y(x) = Ajj^+hiG", R). 
This shows that one can in time obtain A(r) and y(r), as desired. □ 

With the help of this last lemma we can now prove the main theorem of this section. 

Proof of Theorem^ Given an instance of II with G G Q for a graph class Q with 

expansion bounded by /: N — y R and having fixed a constant d G N, we calculate a 2 d - 
approximation S of a minimal treedepth-ci-modulator using Lemma [2] Then, we use the 
above Algorithm [T] to obtain the decomposition Y tfcl Y\ tfcl • • - l±l Yg as defined in Lemma [4] 
Each cluster Yj, 1 ^ i ^ £ forms a protrusion with boundary size |A^(Yj)| ^ 2 d+1 + 2f(2 d ) =: h 
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and treedepth (and thus treewidth) at most d. Applying the protrusion reduction rule to 
each individual cluster as in Lemma [5] then yields an equivalent instance (£?',£') with 

\V(G')\ = |*o| + £*7 ^0{\S\)+i-p{d + 2 d+1 + 2/(2 d ), d) = 0(\S\) ■ 0(1) = 0(\S\) 

i=l 

where Y( denote the clusters obtained through applications of the reduction rule. As G is 
degenerate, the above bound implies that |V(G')| + \E{G') \ = 0(\S\), too. □ 

Several graph problems have finite integer index on the class of all graphs and thus admit 
linear kernels on graphs of bounded expansion if parameterized by a treedepth modulator. 

Corollary 3. The following graph problems have finite integer index, and hence have linear 
kernels in graphs of bounded expansion, when the parameter is the size of a modulator to 
constant treedepth: Dominating Set, t-Dominating Set, Efficient Dominating Set, 
Connected Dominating Set, Vertex Cover, Hamiltonian Path, Hamiltonian 
Cycle, Connected Vertex Cover, Independent Set, Feedback Vertex Set, 
Edge Dominating Set, Induced Matching, Chordal Vertex Deletion, Odd 
Cycle Transveral, Induced g?-Degree Subgraph, Min Leaf Spanning Tree, Max 
Full Degree Spanning Tree. 

For a more comprehensive list of problems that have FII in general graphs (and hence fall 
under the purview of the above corollary), see [6]. 

Some problems do not have FII in general (see jl2|) but only when restricted to graphs of 
bounded treedepth, and for those we have the same conclusion in the following: 

Lemma 6. Let Q be any graph class and Q{d) be those graphs of Q that have treedepth at 
mostd. The problems Longest Path, Longest Cycle, Exact s,£-Path, Exact Cycle 
restricted to Q have FII in Q(d) C £ for any d G N. 

Proof. Let II be any one of the mentioned problems restricted to Q, and let d,t be constants. 
Consider the class Q t of t-boundaried graphs over Q, and let T = {0, 1, . . . , t}. 
We define a configuration of II with respect to Q t as a mutiset 

C = {(si,di,ti), . . . , (sp } d p ,t p )} 

of triples from T x N x T. We say a t-boundaried graph G G Qt satisfies the configuration 
C if there exists a set of (distinct) paths Pi, . . . , P p in G such that 

• Si,ti can only be endvertices of Pi, V(Pi) Hbd(G) C {sj,£j}, and \P,i\ = di, for 1 ^ i ^ p, 

• V(Pi) n V(Pj) C bd(G) for 1 < i < j < p, 

• V(Pi) n V(Pj) n V(P fc ) = for 1 < % < j < k < p. 
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Note that, for simplicity, we identify the boundary vertices in bd(G) with their labels 1, . . . , t 
from T. Moreover, Si,ti can take the value which is not contained in bd{G): semantically 
these tuples describe paths which intersect the boundary of G at only one or no vertex. 
Another special case are tuples with Sj = U and d — 0: those describe single vertices of the 
boundary. In short, a graph satisfies a configuration if it contains internally non-intersecting 
paths of length and endvertices prescribed by the tuples of the configuration, and no three 
of the paths are prescribed to have the same endvertex (hence some configurations are not 
satisfiable at all, but this is a small technicality). 

The signature cr[G] of a graph G G Qt is a function from the configurations into {0, 1} 
where a[G](C) — 1 iff G satisfies C. We claim that the equivalence relation ~ CT defined via 

Gi ——{j G 2 cr[Gi] = a[G 2 ] for G u G 2 G Qt 

is a refinement of ee h t . We provide only a sketch for II = Longest Path, the proofs for 
the other problems work analogous. To this end we assume the contrary, that <j[G\\ = a[G 2 ] 
while Gi ^n,* G 2 . Up to symmetry, this means that for all integers c there exists a graph 
G 3 G Q t such that (Gi © G 3 , £) G II but (G 2 © G 3 , £ + c) G" II. We choose c = and show the 
contradiction. Thus the graph Gi © G 3 contains a path P of length I but G 2 © G 3 does not. 

Using the implicit order given through the vertex order of P we sort the subpaths of P 
contained in Pfl G\ and so obtain a sequence of paths Pi, . . . , P q C Gi, each with at most two 
vertices - the ends, in bd{G\). By identifying each subpath Pj with the tuple (s,, rfj,tj) where 
= |Pj| and Sj is the label of the start of P; in bd{G\) (or if Sj G" bd{G\)) and tj the label of 
the end of Pj in bd{G\) (ditto), we obtain a configuration Cp = {(s 1 , d-y, ti), . . . , (s 9 , d g , i 9 )}. 
Now, Gi satisfies Gp by the definition. Since a[Gi](Cp) = a[G 2 ](Gp), there exists a set of 
paths Qi, . . . , Q q C G 2 witnessing that G 2 satisfies Gp. But then Qi, . . . ,Q g together with 
P H G 3 form a path Q of length £ in G 2 © G 3 , a contradiction. 

Second, although ~ CT is generally of infinite index, we claim that for every t, only a 
finite number of equivalence classes of ~ a carry a representative from Qt(d) - the subclass 
of treedepth at most d. This is rather easy since graphs of treedepth ^ d do not contain 
paths of length 2 d — 1 or longer, and so a graph G G £/t(cf) can satisfy a configuration 
G = {(si,di,ti), . . . , (s p , d p , t p )} only if di G {0, 1, . . . , 2 d — 2} for 1 ^ % ^ p. Recall, each 
boundary vertex label occurs at most twice among s±, t±, . . . , s p , t p in a satisfiable configuration. 
Hence only finitely many such configurations G can be satisfied by a graph from Qt{d), and 
consequently, finitely many function values of cr[G] are nonzero for any G G <?t(d) and the 
number of the nonempty classes of ~ CT restricted to Q t (d) is finite. □ 

Lemma 7. Let Q be any graph class and Q(w) be those graphs of Q that have treewidth at 
most w. The problems Treewidth and Pathwidth restricted to Q have FII in Q(w) C Q 
for any w G N. 

Proof. Let II = Treewidth (the proof works analogously for Pathwidth) restricted to 
Q, and let w,t be constants. Consider the class Q t of t-boundaried graphs over Q, and let 
U = {1,2,...,*}. We again, for simplicity, identify the boundary vertices in a graph from Qt 
with their labels 1, . . . , t from U. 
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We mimic the proof of Lemma [6] with some changes. We define a configuration of II 
wrt. Q t as a set C = {{X X , Wi), . . . , (X p ,w p )} of pairs such that Xi C 2 17 and w; 6 N for 
% — 1, . . . ,p. We say a t-boundaried graph G a Gt satisfies the configuration C if there exists 
a collection of induced subgraphs Hi, ... , H p of G such that 

• V(Hi) n V(#i) C 6d(G) for 1 < i < j < p, and #i U . . . U H p = G, 

• there exists a tree decomposition (Tj, Wi), z = 1, 2, . . . ,p, of the graph Hi of width at 
most Wi, 

• each X 6 ^ is a bag in this decomposition, i.e., X G Wj. 

The signature o~[G] of a graph G 6 ^ is a function from the configurations into {0, 1} 
where <r[G](G) = 1 iff G satisfies C. We claim that the equivalence relation ~ CT defined via 

G x ——(j G 2 = a[G 2 ] for G 1; G 2 G & 

is a refinement of =n,t- To this end we assume the contrary, that o~[Gx] = <x[G 2 ] while 
G\ ^u,t Gi- Up to symmetry, this means that for all integers c there exists a graph 
G 3 G & such that (G ± © G 3 , k) G II but (G 2 © G 3 , fc + c) G" II. We choose c = and show the 
contradiction. Thus the graph Gi©G , 3 has a tree decomposition (T, W) of width k but G^ffiG^ 
does not. We will set B = bd(Gi) and assume for simplicity that B = bd(Gz) = bd(Gs) = U, 
i.e., B C Gi © G 3 as well as I? C G 2 © G 3 . As i? is a vertex-separator of Gi © G 3 , we can 
assume that no bag in W contains both vertices from G X \B and from G^\B. We will further 
assume that each bag in W appears exactly once in the tree decomposition, that every subset 
X C B which is contained in some bag also exists exclusively as a bag X G W, and that for 
no adjacent bags their union contains both vertices from G\\B and from G^\B (all three 
conditions can easily be enforced without increasing the width of the decomposition). 

We color the nodes of T with colors white, black and red according to the following 
criterion: every x G V(T) is assigned the color c(x), where c(x) is 

• red if W x C B, and otherwise 

• white if W x C V{Gi) and black if W x C V(G 3 ). 

The above conditions on the structure of (T, W) now imply that c partitions the nodes V(T) 
into T w hite,Tbiack,T re d, and that no white node is adjacent to a black node in T. 

From this coloring we create a collection of subtrees T% , . . . , T q - the connected components 
of T — T b i ack . Let Hi, i = 1, . . . , q, be the subgraph of G\ induced by Uneven) W x , and let 
(Tj, Wi) denote the corresponding tree decomposition of Hi. We denote by Wi the width of 
(Tj, Wj) and set = {W^ G Wj : x G ^(rrerf)}. Now, the subgraphs Hi, . . . ,H q witness that 
the graph Gi satisfies the configuration Ct = {(Xi, wi), . . . , (X q , w q )} by definition. 

Since <t[Gi](Ct) = o~[G2](Ct), there exists a collection of induced subgraphs H[, . . . , H' 
of G*2, and their tree decompositions {T[, W(), . . . , (TL W') witnessing that also G 2 satisfies 
Ct (particularly with the same widths w x , . . . ,w q , respectively). Moreover, for each x G 
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V(T red ), the bag W' x G W- (for the appropriate % such that W x G Wj above) is the same as 
W' x = W x G Won the boundary B. We make T' as the union, by identification of nodes in 
V(T red ), of T — V{T w hite) with T[ U . . . U T^, and set W' to be the union of W restricted to 
the nodes of Tf,i ac k U T red with W{ U . . . U W^. But then (T', W") is a tree decomposition of 
width k in G 2 © G 3 , a contradiction. 

Second, although ^ is generally of infinite index, we claim that for every t, only a 
finite number of equivalence classes of ~ CT carry a representative from (?t(u>) - the sub- 
class of treewidth at most w. For this we claim that a graph G of treewidth ^ w can 
satisfy a configuration C = {(Xi, w±), . . . , (X p , w p )} only if G satisfies also the configuration 
{(Xi,w[), . . . , (X p , w'p)} where w[ = min(wj, w + t) for 1 ^ i ^ p. To see this, notice that 
one can take a tree decomposition of whole G restricted to witness subgraphs Hi (notation 
as above) and add suitable subsets of the boundary to (some) bags, to form the witness 
tree decomposition for (A^w-). Moreover, p ^ 2 2 as every combination of subsets of the 
boundary can appear at most once. Therefore, finiteness of ~ CT restricted to Qt{w) follows as 
at the end of Lemma [6] □ 

Corollary 4. The problems Longest Path, Longest Cycle, Exact s,t-PATH, Exact 
Path, Treewidth, and Pathwidth have linear kernels in graphs of bounded expansion 
with the size of a modulator to constant treedepth as the parameter. 



4.1 Extension to larger graph classes 

We can extend our result to classes of graphs of locally bounded expansion and furthermore 
to graphs that are nowhere dense. 

Definition 12 (Locally bounded expansion [16]). A graph class Q has locally bounded 
expansion if there exists a function / : N x N — > R (called the expansion function) such 
that for every graph G G Q and all r, d G N and every vertex v G V(G), it holds that 
V r (G[N d (v)})^f(d,r). 



Definition 13 (Nowhere dense 27 28]). A graph class Q is nowhere dense if for all r G N it 
holds that u(^Vr) < oo. 

In the above definition we use the natural extension of oo to classes of graphs via u(Q) = 
sup Gg g ou(G). Note that both graph classes are closed under taking shallow minors in the 
sense that Q V r has locally bounded expansion (is nowhere dense) if Q has locally bounded 
expansion (is nowhere dense), albeit with a different expansion function (a different bound 
on the clique size of r-shallow minors). 

We claim the following two kernelization result for the above classes, which in particular 
apply to all problems listed in Section |4j 

Theorem 2. Let Q be a graph class of locally bounded expansion and for p G N, let Q{p) C Q 
be the subclass of graphs of treedepth at most p. Let Tig be a graph problem that has finite 
integer index on Q{p) for all p G N and let d G N be a constant. Then there is an algorithm 
that takes as input (G,£) G Tig and, in polynomial time, outputs an equivalent instance ((?',£') 
such that \G'\ = 0(\S\ 2 ), where S is an optimum treedepth-d modulator of the graph G. 
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Theorem 3. Let Q be a nowhere- dense graph class for p G N, let G(p) C Q be the subclass 
of graphs of treedepth at most p. Let Ug be a graph problem that has finite integer index 
on Q(p) for all p G N and let d G N be a constant. Then there is an algorithm that takes 
as input (G,£) G Ug and, in polynomial time, outputs an equivalent instance (£?',£') such 
that \G'\ = 0(\S\ C ) for some constant c, where S is an optimum treedepth-d modulator of the 
graph G. 

The proofs of Theorems [2] and [3] follow analogously to the proof of Theorem [T] using 
Lemma [8] (see below) in place of Lemma [3j We need additional notation. Let #u(G) 
denote the number of complete subgraphs of G. For a graph class Q and an integer i we let 
Q^i := {H G Q | \H\ ^ £} denote those graphs of Q which have at most i vertices. 

Definition 14 (Greatest reduced average clique density). For a graph G and integer r we 
define U r {G) — max/f g G' Vr (^a;(i/)/|i/|) to be the greatest reduced clique density (clique- 
grad) with rank r of G. For a graph class Q the clique expansion with rank r is defined as 

□r(0) = SUp Geg n r (G). 

Lemma 8. Let G = (X, Y, E) be a bipartite graph let Hx = (Gv 1)<|X|- Then there are at 
most 

1. 2Vo("Hx) '\X\ vertices in Y with degree larger than w(7ix); 

2. (p (H x ) + 2Vo("Hx)) • \X\ subsets X' C X such that X' = N{u) for some ueY. 

Proof. We construct a sequence of graphs Go, G\, . . . , Ge analogous to the proof of Lemma [3] 
Note that, by construction, we have that GJX] G Hx f° r 1 ^ i ^ £■ In particular, this 
implies that G^[X] has at most 2Vo("Hx) • |^| edges. 

Let us now prove the first claim. To this end, assume that there is a vertex v G Y H V{Gg) 
such that deg(f ) > cj('Hx)- We claim that G([N{y)] (where N(v) C X) is a clique. If not, 
we could choose a pair of non-adjacent vertices in Gi[N(v)] and construct a (£ + l)-th graph 
for the sequence which would contradict the fact that Gt is the last graph of the sequence. 
However, the set N(v) then induces a clique of size larger than u(Hx), a contradiction. 

Hence we conclude that no vertex of Y fl V(Ge) has degree > u(T-L x ) in Gi (and thus 
in G). Therefore the vertices of Y of degree > uj(T-Lx) in the graph G, if there were any, 
must have been deleted during the edge contractions that resulted in the graph Ge. As every 
contraction added at least one edge between vertices in X and since G^[X] contains at most 
2Vo("Hx) • \X\ edges, the first claim follows. 

For the second claim, consider the set Y' = Y n V(Ge). As observed above, the neighbor- 
hood of every vertex v G Y' induces a clique in Ge[X\. The number such sets therefore can be 
upper bounded by the number of cliques in G^[X], which in turn can be bounded as follows: 

#u(G t [x\) = n (G e [x})\x\ < □„(((? v iU\ X \)\x\ = n {H x )\x\ 

In total then the number of subsets of X that are neighborhoods of vertices in Y in G is at 
most (□o("Hx) + 2Vo("Hx))|AT|, where we accounted for vertices of Y lost via contractions by 
the bound on the number of edges in G^LY]. □ 
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The following two corollaries are analogs of Corollary [T] and [2] and will be used in a similar 
fashion. 



Corollary 5. Let Q be a graph-class. Suppose that for G G Q and S C V(G), C\, . . . , C s 
are disjoint connected subgraphs of G — S satisfying the following two conditions: for 1 ^ 
i < s, diam(G[V(Ci)}) < 5 and \N s (Ci)\ > u(H s ) where U s = (Gv(5 + Then 
s^2V (Hs)-\S\. 

Proof. We construct an auxiliary bipartite graph G with partite sets S and Y = {C±, . . . , C s }. 
There is an edge between Cj and x G S iff x G Ns(Ci). Note that G is a shallow minor at 
depth 5 of G by the assumption, and therefore (G V l)^|s| Q Us- By Lemma 8 



s^2Vo((Gvlks|)|5 , |^2V (^ s )|5 , |. 



□ 



Corollary 6. Let Q be a graph-class. Suppose that for G G Q and S C V((j), Ci, . . . ,C t are 

sets of connected components of G — S such that for all C, C' G U« Ci it holds that C, C' G Cj 
for some j if and only if Ns(C) = Ns(C'). Let 5 > be a bound on the diameter of the 
components, i.e. for all C G \JiCi, diam(G[V(C)]) ^ 5. Then there can be only at most 
t ^ (□q('Hs) + 2Vo("HsO) • \S\ such sets Ci where again %s — {Gv{5 + 



Proof. As in the proof of Corollary |5j we construct a bipartite graph G with partite sets S 
and Y = {Ci, . . . , C r }, where the vertices Cj represent connected components in IJiC; an d 
Cj has an edge to x G S iff x G Ng(Cj). As before, G is a shallow minor at depth 5 of G and 
therefore (Gv ^ By Lemma [IJ 

< < I {5" c 5 1 ac, g y : N(Ci) = s'}\ 

< (Do((G V 1)^,) + 2V ((G v Ik,*,)) • |5| 
^(□o(H 5 ) + 2Vo(H 5 ))-|5|. 

□ 

Note that, using the notation of Lemma [8| we have the trivial bounds 2Vo('Hx) ^ \X\ 
and □o('Hjf) ^ |-X"| < "( Wx )~ 1 . For graphs of locally bounded expansion this second bound can 
be improved as follows. 

Lemma 9. Let Q be a graph class with local expansion bounded by f: N x N — > R. Then for 
any graph G G Q, any constant c and any integer ^ t ^ |V(G)|, \3 (^(G V c)^J ^ 4f( 1+c >°)£, 

Proof. Consider any H G (G\7c)^. Note that H G (G V c)^£ C G V c, and thus if has local 
expansion bounded by f'(d, r) = f(d + c,r). 

We upper-bound the cliques in H iteratively as follows: pick a vertex v, count all cliques 
that contain v and add those to the number of cliques in H — v. Now, all cliques that 
contain a fixed vertex v must be contained in N[v]. As G[A[f]] is a radius-one subgraph 
of H, it has bounded expansion with expansion function f'(l,r) = /(l + c, r) and thus is 
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2/(1 + c, 0)-degenerate. We can now apply the result of (32] , stating that every <i-degenerate 
graph G with n ^ d vertices has at most 2 d {n — d + 1) cliques. Doing so we see that G[iV[t>]] 
contains at most 2 2/(1+c,0) |iV[t>]| < 4/( 1 + c >°)|# | < 4 /(1+c,0) £ cliques. Iterating this counting 
over all vertices of H then yields a generous bound of 4-^ 1+c,0 )£ 2 and therefore we obtain the 
desired bound for the clique density through division by £. □ 

The following generalization of Lemma [4] follows easily using the above two corollaries. 

Lemma 10. Let Q be a graph class, G G Q and S C V(G) be a set of vertices such that 
td(G — S) ^ d (d a constant). Let Us = (G V 2 d )^\s\. Ifu(7is) is a constant, then there is 
an algorithm that runs in time linear in \G\ and partitions V(G) into sets Yq l±J Y\ l±l • • • l+J Yi 
such that the following hold: 

1. S C Y and \Y \ < 2Vq(U s ) ■ \S\; 

2. for 1 ^ % ^ I, Yi induces a set of connected components of G — Yq that have the same 
neighborhood in Y of size at most U}(7is); 

3. £^(p (H s ) + 2V (H s ))-\S\. 

Proof. We proceed exactly as in the proof of Lemma [4] using t := uj^Hs) an d the bounds 
from Corollary [5] and [6] □ 

We are now ready to prove the two theorems. 



Proof of Theorem [j| Analogously to the proof of Theorem [T] we use Lemma 10 to obtain 
a protrusion-decomposition Yq l±l Yi W • • • W Yi in place of Lemma [4] Let G be a graph 
from a class of locally bounded expansion and let d be an integer and S C V(G) be a 
treedepth-c? modulator of G. It is left to show that for %$ — (G\/2 d )^\ S \ the bounds of 



Lemma 10 are indeed quadratic in \S\. Clearly, Vq(G) ^ \G\, thus Vq(Hs) ^ \S\ and 
therefore |Y"o| — @{\S\ 2 )- The bound □o('Hs) = 0(15*1) was proved in Lemma [9] and therefore 
£ ^ (n (H s ) + 2V (Hs))\S\ = 0(\S\ 2 ) and the claim follows. □ 



Proof of Theorem^ Analogously to the proof of Theorem [T] we use Lemma 10 to obtain a 
protrusion-decomposition Yq l±J Y\ l±l ■ ■ ■ t±J Ye in place of Lemma [4] Let G be a graph from a 
nowhere-dense graph class and let d be an integer and S C V(G) a treedepth-c? modulator 
of G. It is left to show that for — {Gv2 d )^\s\ the bounds of Lemma 10 are indeed 
polynomial in \S\. Clearly, V (G) ^ \G\, thus V CH 5 ) < \S\ and therefore \YqJ= 0(\S\ 2 ). 

For □o(Hs) we use the trivial bound of □o('Hs) ^ ^S^^ 8 ' 1 , so it is left to show that 
uj(Hs) is a constant. As Hs Q G V 2 d and per definition of nowhere-dense graph classes, 
oj(G V r) < 00 for every constant r, the claim follows. □ 
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5 Structural Parameterizations of Longest Path 



In this section we show that the problem Longest Path has a polynomial kernel when 
parameterized by a modulator to constant treedepth. Our result almost entirely closes the 
gap between the polynomial kernel of Longest Path when parameterized by the size of a 
vertex cover and the no polynomial kernel result for Longest Path when parameterized by 
the size of a modulator to pathwidth two [7] . 

It is well-known that Longest Path can be solved in linear time if the treewidth of the 
input graph is bounded by some constant |31. Because of the relationship between treewidth 
and treedepth (see Section [2]) this result carries over to treedepth. 

Proposition 5. Longest Path can be solved in linear time if the treedepth of the input 
graph is bounded by some constant. 

The following lemma is at the very core of our result. 

Lemma 11. For fixed d G N, d ^ 1, let S C V(G) be a treedepth-d modulator of a graph G 
and let k = \S\. Then there is an induced subgraph G' of G and a set S' C V(G') such that: 
(1) G and G' are equivalent instances of LONGEST Path (for the same path length), (2) G' 
and S 1 can be computed from G and S in time Oik 2 ■ \V(G)\), and (3) S' is a treedepth-(d — 1) 
modulator of G' of size \S'\ ^ {k + l) 3 . 

Proof. Let U be the family of vertex sets of all connected components of G — S. Since for 
each U the graph G[U] has treedepth at most d, there exists rjj G U (the root of some 
treedepth d decomposition) such that G[U — {ru}] has treedepth d — 1. Therefore if we 
can find in time 0(k 2 ■ \ V(G)\) a subfamily W C U of size at most {k + l) 3 — k such that 
G' = G[S U Uuew U] is an equivalent instance of Longest Path, the claim of the lemma 
follows. To see this, notice that we can use vertices ru, one for each U G W, together with 
vertices from S to form treedepth- (d — 1) modulator S' of G' . The modulator 5" will therefore 
consist of k vertices from S and at most (k + l) 3 — k new vertices, one from each component 
of W, and so \S'\ ^ (k + l) 3 , as claimed. 

In the rest of the proof, we show that we can find the family W with the aforementioned 
properties in desired time. 

Assume \U\ > k + 1. For all U G U and x,y G S with x ^ y we denote 

i. by hP(U) a longest path in the graph G[U] (we choose any one if not unique), and by 
Uq G U a representative achieving maximum value |LP(f/ )| over 14; 

ii. by LP(x, U) a longest path starting from x in the graph U U], and by WjCWa 
subfamily of \U X \ = k + 1 ("top k + 1 representatives" by |LP(x, U)\) such that for any 
Ut G U x , U 2 G U\U X it is |LP(x, U x )\ ^ |LP(x, U 2 )\; 

iii. by LP(x, y, U) a longest path between x and y in the graph y}i)U], or LP(x, y,U) — 
if no such path exists, and analogously by U x , y ^Wa subfamily of \U XjV \ — k + 1 ("top 
k + 1 representatives" by \LP(x,y, U)\) such that for any U\ G U X} y, G U \ U x>y it is 
\l&(x,y,U x )\>\VP(x,y,U 2 )\. 
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Because td(G[U}) sC td(G[{x} U U]) ^ td(G[{x,y} U C7]) < d + 2 (a constant), it follows 
from Proposition [5] that LP(Z7), LP(x, U), LP(x,y, U) can each be computed in linear time, 
and hence the whole computation of Uo,U x ,U X) y can be done in Oik 2 ■ |V(G)|) time. 

We claim that the family W = {Uq} U \J X £gU x U \] xy ^ Sx ^ y U. x ^ together with S induces 
graph G' which satisfies the conclusion of the lemma. Clearly, \W\ ^ (^j(k + l) + k(k + l) + l = 
\k{k + l) 2 + 1 ^ (k + l) 3 . It remains to show that if G has a path of length at least £ then 
so does G' = G[SU{Juew U}. 

Let P be a path of length at least £ in G and let q = \V(P) fl S\ ^ k. Then S "cuts" 
P into q + 1 sections, i.e., we can write P = Pq U Pi U . . . U P q where Pj, i — 0, . . . , q are 
mutually edge-disjoint paths disjoint from S except possibly at their ends. Suppose that 
P <2 G' . There are three cases to consider for the subpaths P,: 

I. q — and P = P . Then the length of P is at most |LP(Z7o)| by the definition, and 
hence we can choose P' := LP(Z7o) C G 1 straight away. 

II. q ^ 1 and Pq % G' or P q % G' . Consider, without loss of generality, the latter case 
Pg % G' and let {x} = V(P q ) n S. Then the length of P g is at most |LP(z, U)\ for any 
U G IA X by the definition. Notice that each of the q ^ k paths Pj, i — 0, . . . , q — 1, can 
intersect only at most one component from U by connectivity (and P g is disjoint from 
all of U x ). Hence, at least k + 1 — q ^ 1 component(s) in U x , say Ui, is disjoint from 
whole P. Then in P we replace P q with LP (a;, C/i). 

III. q > 1 and P g G" where < i < g. Let {x, y} = V(Pi) fl 5. Then the length of P is 
at most |LP(x, y, U)\ for any [/ e by the definition. For the same reason as above 
there exists a component U2 € Z/^y not intersected by P, and we then in P replace Pj 
with LP(x,y,U 2 ). 

Repeating II, III for all sections of P, the resulting path P 1 C G" has length at least |P| ^ £, 
and this concludes the proof of the lemma. □ 

Theorem 4. Let d G N be a constant, and let the function g be defined as follows; g(0, k) = k 
and g(i, k) = g(i — 1, (k + l) 3 ) . Then Longest Path has a polynomial kernel of size at 
most g(d, k) parameterized by the size k of a modulator to treedepth d where, asymptotically, 
g(d, k) = 0(k 3 ). This kernel is computable in time 0(k 2 ■ \V(G)\). 

Proof. Let G be a graph, and S C V(G) a treedepth-<i modulator of G. We proceed by 



induction on d ^ 0: For d = we necessarily have S = V(G) (cf. Lemma 11) and hence 



immediately a kernel of size k = g(0,k). For d > 0, we apply Lemma 11 to obtain an 
equivalent instance G' with modulator 5" of size k' = \S'\ ^ (k + l) 3 . Then G' can be 
kernelized to an instance of size at most g(d — l,k') by the inductive assumption, and 
g(d — 1, k') ^ g(d, k) as desired. □ 
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6 On the Ecology of Structural Parameters 



A primary goal of parameterized complexity is to study how different parameters affect 
the complexity of classical problems. In particular, one aims to discover the boundaries 
of tractability by finding the weakest parameterization for which a problem is in FPT or 
admits polynomial kernels. It also provides further insight into what exactly does make hard 
problems hard. This study of how different parameters influence fixed-parameter tractability 
or polynomial kernelizability is referred to as parameterized ecology [17]. A surge in the 
interest of parameterized ecology has helped to make headway in the parameterized ecology 
program (see [8 23 24]). 



In the quest for polynomial kernels one often has to consider two types of structural 
restrictions: restrictions on the input instances and restrictions on the parameters. Some 
problems are way too difficult in general to be tractable. For instance, Dominating Set is 
W[2]-complete and Independent Set is W[l]-complete, but both problems admit linear 
kernels on planar graphs jlj. This raises the question of whether these problems are tractable 
in more general graph classes under stronger parameterization^] such as e.g. the vertex cover 
number. It turns out that both these problems are indeed in FPT when parameterized 
by the vertex cover number. Other problems exist that seem to be much harder, such as 
Dominating Set, which does not admit a polynomial kernel even when parameterized by 
the solution size and the vertex cover number [14]. It is clear that if we wish to identify the 
boundary of polynomial kernelizability for problems that are as difficult as Dominating 
Set, we must necessarily restrict ourselves to special graph classes. 

Another illustrative example is Longest Path. The standard parameterized version 
of this problem is in FPT in general graphs but has no polynomial kernel [2j[5]. When 
parameterized by the size of a vertex cover, it admits a quadratic kernel [7]. This leads us 
to the question as to whether there exist parameters weaker than vertex cover for which 
Longest Path has a polynomial kernel. One possibility is to use the treewidth as the 
parameter. But as was implicitly shown in [7], Longest Path does not admit a polynomial 
kernel even when parameterized by a modulator to a graph of pathwidth two. 

In fact, we conjecture that Longest Path does not have a polynomial kernel in general 
graphs with respect to a modulator to a single path. Therefore if we want a polynomial 
kernel for this problem with respect to a parameter that modulates some graph property, 
then it seems that this property must not admit long paths. This is one reason why we chose 
to parameterize problems by the size of a modulator to bounded treedepth, as graphs of 
bounded treedepth have a bound on the longest path. In this case, we do indeed have a 
polynomial kernel, as was shown in Section [5] If we restrict the input instances by requiring 
that they are members of a graph class of bounded expansion then we obtain a linear kernel. 
On general graphs, the degree of the polynomial is a function of the treedepth. It is an 
interesting question whether this dependency can be removed. 



4 We use adjectives such as "strong" in the sense that they impose a greater structural restriction on the 
input instance. As a case in point, the vertex cover number of a graph is a stronger parameter than say, the 
feedback vertex number, since in the former case the "rest of the graph" is an independent set, whereas in 
the latter case it is a forest. 
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At first glance, a modulator to bounded treedepth seems to be a severely restricting param- 
eter. But note that a vertex cover is a modulator to a treedepth-1 graph and hence our param- 
eter is certainly less restrictive than the vertex cover number. If our conjecture for Longest 
Path holds, this is essentially the best parameter that we can hope for if we want a polynomial 
kernel. But the connection between treedepth and graphs of bounded expansion is deeper. 
Dvorak and Krai showed that for 
any graph class Q of bounded ex- 
pansion and any positive integer p, 
there exists q G N such that ev- 
ery graph G Q has a vertex col- 
oring with q colors such that for 
any i color classes, 1 ^ % ^ p, in- 
duce a subgraph of treedepth at 
most i [26]. That is, any G G Q 
can be partitioned into a constant 
number of subgraphs each of con- 
stant treedepth. 

The existence of a polynomial 
kernel is not the only relevant 
question. It is desirable for the 
kernel to be as small as possible. 
For problems with FII on graphs 
of bounded treedepth, we have 
shown the existence of a linear 
kernel on graphs of bounded ex- 
pansion, which is obviously the 
best we can hope for. But is there 
a weaker parameter that still al- 
lows one to obtain linear kernels 
on graphs of bounded expansion 
for the same set of problems? One 
possibility is to use the size of a 
modulator to bounded treewidth 
as parameter. This is not likely 
to yield linear kernels for the fol- 
lowing reason: Firstly, any graph 
class Q can be transformed into 
a graph class Q of bounded ex- 
pansion by replacing every G G Q 
by G obtained by replacing each 

edge in G by a path on |V(G)| vertices. It is easy to verify that the operation of subdividing 
edges does not change the treewidth. Now let us consider TREEWiDTH-t Vertex Deletion 
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Figure 2: Overview of known kernelization results for selected 
problems on sparse graph classes. The gray fields highlight results 
from this paper. 
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c Assuming that d > t 



28 



which is the prototypical problem parameterized by a modulator to constant treewidth. An 
input to this problem consists of a graph G and parameter k. The question is whether there 
exist at most k vertices whose deletion from G results in a subgraph of treewidth at most t. 
Two special cases of this problem are Vertex Cover, where t — 0, and Feedback Vertex 



Set, where t = 1. It is well-known that Vertex Cover has an O(k) vertex-kernel 11 
and that the best-known kernel for Feedback Vertex Set has 0(k 2 ) vertices 31 , both 



of which hold for general graphs. Fomin et al. |20] showed that TREEWlDTH-t Vertex 
Deletion admits a kernel of size in general graphs. Improving this result to a kernel 
of size g(t) ■ k ^ has proven to be a significant challenge. If we manage to obtain a linear 
kernel on graphs of bounded expansion using a modulator to a bounded treewidth graph as 
parameter, then it would directly follow that TREEWiDTH-i Vertex Deletion has a linear 
kernel in general graphs. This seems too good to be true. 

Finally, many purely decision problems, such as Hamiltonian Path / Cycle and 3- 
Colorability, which have no natural parameter, are covered by our framework. It was 
already shown in 25 that these problems have a linear kernel on if-topological-minor-free 
graphs when parameterized by a modulator to bounded treewidth for ^/-topological minor 
free graphs. Taking a modulator to treedepth allowed us to extend this result to the class of 
bounded expansion graphs by choosing a modulator to bounded treedepth as a parameter. 



7 Conclusions and Further Research 

In this paper we presented kernelization results on graphs of bounded expansion, locally 
bounded expansion, and nowhere dense graphs. To the best of our knowledge, these are the 
very first kernelization results on these graph classes. The parameter that we use is the size 
of a modulator to constant treedepth graphs. Evidence suggests that any meta-theorem on 
linear kernels on graphs of bounded expansion that includes all the problems in Corollary [3] 
necessarily requires a parameter that cannot be weaker than what we have. However for 
problems whose solution sizes are not invariant under edge subdivisions, such as Dominating 
Set and Hamiltonian Cycle, it might be possible to obtain such a result. 

There are some interesting open questions regarding the polynomial kernelizability of 
Longest Path. We conjecture that Longest Path has no polynomial kernel in general 
graphs with the size of a modulator to a single path (of arbitrary length) as parameter. This 
would show that if we use the size of a modulator to a (subgraph closed) graph property as 
parameter, then in general graphs there exists a dichotomy for Longest Path: If the graph 
property excludes long paths, there is a polynomial kernel; otherwise not. The polynomial 
kernel presented here has size k 9 ^ d \ where k is the size of a treedepth-c? modulator and 
g(d) = 3 d . Is there a kernel of size g(d) ■ k°^\ for some function gl 
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8 Appendix 



In this appendix, we define some of the problems that we mention in this paper. 



Longest Path 

Input: A graph G and a positive integer £. 

Problem: Does G contain a simple path of length at least £1 



Longest Cycle 

Input: A graph G and a positive integer £. 

Problem: Does G contain a simple cycle of length at least £1 



Exact 


s,t-PATH 




Input: 


A graph G, two special vertices s,t e V(G) and 


a positive integer £. 


Problem: 


Is there a simple path in G from s to t of length 


exactly £1 



Exact Cycle 

Input: A graph G and a positive integer £. 

Problem: Is there a simple cycle in G of length exactly il 



Feedback Vertex Set 

Input: A graph G and a positive integer £. 

Problem: Is there a vertex set S C with at most £ vertices such that G — S 

is a forest? 



Treewidth 

Input: A graph G and a positive integer £. 

Problem: Is the treewidth of G at most £? 
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Pathwidth 

Input: A graph G and a positive integer £. 

Problem: Is the pathwidth of G at most t! 



TREEWiDTH-t Vertex Deletion 
Input: A graph G and a positive integer £. 

Problem: Is there a vertex set S C with at most £ vertices such that the 

treewidth of G — S is at most t? 



Dominating Set 

Input: A graph G = (V, E) and a positive integer I. 

Problem: Is there a vertex set SCI/ with at most I vertices such that for all 
u G V \ S there exists v G S such that m> G £7 



If in addition, we require that G[S] is a connected graph then the problem is called 
Connected Dominating Set. 



r- Dominating Set 

Input: A graph G = (V, E) and a positive integer £. 

Problem: Is there a vertex set S C. V with at most I vertices such that for all 
u G V \ S there exists v G S such that d(u, v) ^ r? 



Efficient Dominating Set 

Input: A graph G = (V, E) and a positive integer £. 

Problem: Is there an independent set S C V with at most £ vertices such that for 
every u G V \ S there exists exactly one v G S such that uv E E? 



Edge Dominating Set 

Input: A graph G = (V, i?) and a positive integer £. 

Problem: Is there an edge set S C £ of size at most £ such that for every e G E\S 
there exists e' G S such that e and e' share an endpoint? 
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Induced Matching 

Input: A graph G = (V, E) and a positive integer £. 

Problem: Is there an edge set S C E of size at least £ such that S is a matching 
and for all u,v E V(S), ii uv E E then uv £ S7 



Chordal Vertex Deletion 

Input: A graph G = (V, E) and a positive integer £. 

Problem: Is there a vertex set S C of size at most £ such that G — S is chordal? 
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